首页> 外文OA文献 >Towards Speech Emotion Recognition 'in the wild' using Aggregated Corpora and Deep Multi-Task Learning

【2h】

Towards Speech Emotion Recognition 'in the wild' using Aggregated Corpora and Deep Multi-Task Learning

机译：利用聚合语言实现“在野外”的语音情感识别语料库与深度多任务学习

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the challenges in Speech Emotion Recognition (SER) "in the wild" isthe large mismatch between training and test data (e.g. speakers and tasks). Inorder to improve the generalisation capabilities of the emotion models, wepropose to use Multi-Task Learning (MTL) and use gender and naturalness asauxiliary tasks in deep neural networks. This method was evaluated inwithin-corpus and various cross-corpus classification experiments that simulateconditions "in the wild". In comparison to Single-Task Learning (STL) basedstate of the art methods, we found that our MTL method proposed improvedperformance significantly. Particularly, models using both gender andnaturalness achieved more gains than those using either gender or naturalnessseparately. This benefit was also found in the high-level representations ofthe feature space, obtained from our method proposed, where discriminativeemotional clusters could be observed.

机译：“野外”语音情感识别（SER）的挑战之一是训练和测试数据（例如说话者和任务）之间的巨大不匹配。为了提高情感模型的泛化能力，我们建议在深度神经网络中使用多任务学习（MTL）并使用性别和自然性辅助任务。该方法在体内和模拟“野外”条件的各种跨体分类实验中进行了评估。与基于单任务学习（STL）的最新方法相比，我们发现我们的MTL方法显着提高了性能。特别是，使用性别和自然的模型比分别使用性别或自然的模型获得了更多收益。从我们提出的方法中获得的特征空间的高级表示中也发现了这种好处，其中可以观察到判别性情感聚类。

著录项

作者
Kim, Jaebok; Englebienne, Gwenn; Truong, Khiet P.; Evers, Vanessa;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Coarse-to-Fine Speech Emotion Recognition Based on Multi-Task Learning [J] . Zhao Huijuan, Ye Ning, Wang Ruchuan Journal of signal processing systems for signal, image, and video technology . 2021,第2a3期

机译：基于多任务学习的粗致良好的语音情感识别
2. Effective Deep Multi-source Multi-task Learning Frameworks for Smile Detection, Emotion Recognition and Gender Classification [J] . Sang Dinh Viet, Cuong Le Tran Bao Informatica: An International Journal of Computing and Informatics . 2018,第3期

机译：有效的深度多源多任务学习框架，用于微笑检测，情绪识别和性别分类
3. Multi-Task Learning in Deep Neural Networks for Mandarin-English Code-Mixing Speech Recognition [J] . Mengzhe CHEN, Jielin PAN, Qingwei ZHAO, IEICE transactions on information and systems . 2016,第10期

机译：深度神经网络中的多任务学习，用于普通话-英语代码混合语音识别
4. Speech Emotion Recognition Based on Multi-Task Learning [C] . Huijuan Zhao, Zhijie Han, Ruchuan Wang Intl Conference on Big Data Security on Cloud;IEEE Intl Conference on High Performance and Smart Computing;IEEE Intl Conference on Intelligent Data and Security . 2019

机译：基于多任务学习的语音情感识别
5. Multi-task learning deep neural networks for automatic speech recognition [D] . Chen, Dongpeng. 2015

机译：多任务学习深度神经网络自动语音识别
6. Deep Learning Techniques for Speech Emotion Recognition from Databases to Models [O] . Babak Joze Abbaschian, Daniel Sierra-Sosa, Adel Elmaghraby 2021

机译：语音情感认可的深度学习技术从数据库到模型
7. Effective Deep Multi-source Multi-task Learning Frameworks for Smile Detection, Emotion Recognition and Gender Classification [O] . Sang Dinh Viet, Cuong Le Tran Bao 2018

机译：有效的深度多源多任务学习框架，用于微笑检测，情感认可和性别分类

Towards Speech Emotion Recognition 'in the wild' using Aggregated Corpora and Deep Multi-Task Learning

摘要

著录项

相似文献

相关主题

期刊订阅